Unsupervised Ontology Population Using Latent Semantic Analysis
نویسندگان
چکیده
A large ontology such as lexical ontology is useful as the basic knowledge base in artificial intelligence and computational linguistics application. However, it is insufficient to recognize only existing instances for each concept. Adding new instances into the lexical ontology will expand knowledge in the system. In this paper, we propose an efficient unsupervised ontology population system that classifies new instances into a corresponding lexical ontology concept. Compared to previous related works, it does not require manual preprocessing to prepare training data. In terms of processing time, it does not need to search for many concepts in the lexical ontology. Our system employs latent semantic analysis together with context voting to find the appropriate concept of the instance. In sum, the system achieves higher accuracy when the lexical ontology contains a lot of concepts, which generally occurs in practical problems.
منابع مشابه
Presentation of an efficient automatic short answer grading model based on combination of pseudo relevance feedback and semantic relatedness measures
Automatic short answer grading (ASAG) is the automated process of assessing answers based on natural language using computation methods and machine learning algorithms. Development of large-scale smart education systems on one hand and the importance of assessment as a key factor in the learning process and its confronted challenges, on the other hand, have significantly increased the need for ...
متن کاملPresentation of an efficient automatic short answer grading model based on combination of pseudo relevance feedback and semantic relatedness measures
Automatic short answer grading (ASAG) is the automated process of assessing answers based on natural language using computation methods and machine learning algorithms. Development of large-scale smart education systems on one hand and the importance of assessment as a key factor in the learning process and its confronted challenges, on the other hand, have significantly increased the need for ...
متن کاملCombining Rule-Based Methods and Latent Semantic Analysis for Ontology Structure Construction
We report on a pilot study of techniques for organizing terms derived from relevant corpora into semantic clusters. We combine rule-based techniques and an unsupervised statistical method to semantically cluster terms extracted from medical reports. The rule-based approach identifies lexicosyntactic patterns suggestive of parent-child relations and coordination structures suggestive of sibling-...
متن کاملUnsupervised structured semantic inference for spoken dialog reservation tasks
This work proposes a generative model to infer latent semantic structures on top of manual speech transcriptions in a spoken dialog reservation task. The proposed model is akin to a standard semantic role labeling system, except that it is unsupervised, it does not rely on any syntactic information and it exploits concepts derived from a domain-specific ontology. The semantic structure is obtai...
متن کاملTerminological ontology learning and population using latent Dirichlet allocation
The success of Semantic Web will heavily rely on the availability of formal ontologies to structure machine understanding data. However, there is still a lack of general methodologies for ontology automatic learning and population, i.e. the generation of domain ontologies from various kinds of resources by applying natural language processing and machine learning techniques In this paper, the a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010